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AN a-GALACTOSIDASE ENZYME 
FIELD OF THE INVENTION 

5 

The present invention relates to a DNA construct encoding an 
a-galactosidase enzyme and variants thereof having a-galacto- 
sidase activity, a recombinant expression vector and a cell 
harbouring said DNA construct, and a method of preparing an 
10 a-galactosidase enzyme preparation by use of recombinant DNA 
techniques. The a-galactosidase enzyme encoded by the DNA 
construct of the invention may, inter alia , be used for the 
degradation of a-galactosides present in various plant prod- 
ucts, or as a digestive aid. 

15 

BACKGROUND OF THE INVENTION 

ot-galactosidase is a well-known enzyme involved in the hydro- 
20 lysis of a-galactosides present in, for instance, various im- 
portant plants or plant parts used for nutritional purposes 
such as legumes, vegetables, grains, cereals and the like, a- 
galactosidase enzymes are produced by various microorganisms, 
plants and animals. Mammals, however, are deficient in intes- 
25 tinal a-galactosidase production and, consequently, are inca- 
pable of decomposing ingested a-galactosides by themselves. 
Instead, ingested a-galactosides are decomposed by microorga- 
nisms present in the intestine. This microbial decomposition 
normally results in flatulence and further confers a diges- 
30 tive discomfort to the mammal upon ingestion of a-galactosi- 
de-containing food or feed. The physiological effects of a- 
galactosides are discussed in detail by Rackis, J. J., 1975. 

In order to overcome the problem associated with mammalian a- 
35 galactosidase deficiency, a-galactosides contained in food or 
feed have been modified prior to ingestion, for instance en- 
zymatically by the action of a-galactosidase. Alternatively, 
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a-galactosidase has been suggested as a digestive aid, cf. WO 
90/14101. 

The production of a-galactosidase has been reported from bac- 
5 teria, e.g. Bacillus stearothermophilus (US 3,846,239), 
yeasts, e.g. Saccharomyces cereviciae (US 4,431,737), fungi, 
e.g. strains of the genii Neurospora and Rhizopus (Worthing- 
ton and Beuchat, 1974), Aspergillus oryzae (Cruz and Park, 
1982), A. ficuum (morphologically similar A. niger) (Zapater 

10 et al., 1990) and A. niger (Bahl and Agrawal (1969 and 1972), 
Christakopoulos et al. (1990), Chun and Lee (1988), Jung and 
Lee (1986), Lee and Wacek (1970), Adya and Elbein (1976), Ka- 
neko et al. (1991)). All of these references, however, de- 
scribe the a-galactosidase production by conventional fermen- 

15 tation of naturally occurring or mutated microbial strains. 

Overbeeke et al., 1990, describes the production of a-galac- 
tosidase from guar in Bacillus subtilis and Aslandis et al, 
1989, describes an a-galactosidase from J?, coli. 

20 

An A. niger a-galactosidase enzyme preparation (Alpha-Gal™) 
produced by conventional fermentation is available from Novo 
Nordisk A/S, Denmark. One drawback associated with the pro- 
duction of a-galactosidase by fermentation of A. niger is 
25 that substantial amounts of oxalic acid, an undesired by-pro- 
duct, are produced by A. niger simultaneously with the pro- 
duction of a-galactosidase. 

It would be desirable to be able to produce an A. niger a-ga- 
30 lactosidase enzyme preparation with reduced or without simu- 
ltaneous production of oxalic acid, and further to increase 
the yield and the purity of the a-galactosidase preparation 
so produced. 

35 The object of the present invention is to device means and 
methods for the production of a-galactosidase enzymes by re- 
combinant DNA techniques. By use of such techniques it is 
contemplated to be possible to produce a-galactosidase in 
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substantially larger amounts and more economical than what is 
possible by use of conventional fermentation technology and 
at the same time avoid or reduce the amount of oxalic acid 
formed. 

5 

BRIEF DESCRIPTION OF THE INVENTION 

Accordingly, in a first aspect the present invention relates 
10 to a DNA construct comprising a DNA sequence encoding a poly- 
peptide having a-galactosidase activity, wherein the DNA se- 
quence a) encodes a polypeptide comprising the amino acid 
sequence shown in the appended SEQ ID No. 3, or b) is an ana- 
logue of the DNA sequence of a) , which 

15 

i) hybridizes with the DNA sequence shown in the appended SEQ 
ID No. 1 or 2 or an oligonucleotide probe prepared on the 
basis of said DNA sequence or on the basis of the amino acid 
sequence shown in SEQ ID No. 3 under the conditions defined 

20 below, and/or 

ii) encodes a polypeptide reactive with an antibody reacting 
with at least one epitope of a polypeptide comprising the a- 
mino acid sequence shown in the appended SEQ ID No. 3, and/or 

25 

iii) encodes a polypeptide being at least 50% identical with 
the polypeptide having the amino acid sequence shown in the 
appended SEQ ID No. 3. 

30 The nucleotide sequence shown in SEQ ID No. 1 illustrates an 
entire a-galactosidase gene (including introns) isolated and 
characterized from a strain of Aspergillus niger, and the nu- 
cleotide sequence shown in SEQ ID No. 2 is the corresponding 
cDNA sequence. The nucleotide sequences are further described 

35 in the examples hereinafter. The amino acid sequence shown in 
SEQ ID No. 3 is deduced from the DNA sequence shown in SEQ ID 
No. 2 and illustrates the amino acid sequence of the A. niger 
a-galactosidase enzyme including its signal peptide. 
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In a further aspect the present invention relates to a recom- 
binant expression vector harbouring the DNA construct of the 
invention and a cell which either harbours the DNA construct 
or the expression vector of the invention. 

5 

A still further aspect of the present invention is a process 
for the production of a polypeptide exhibiting a-galactosid- 
ase activity, which process comprises culturing a cell as de- 
scribed above harbouring a DNA construct of the invention in 
10 a suitable culture medium under conditions permitting expres- 
sion of the polypeptide, and recovering the resulting poly- 
peptide from the culture. 

The polypeptide exhibiting a-galactosidase activity may com- 
15 prise the amino acid sequence shown in SEQ ID No. 3. or be a 
variant thereof. The variant may be a naturally-occurring 
variant derived from any source or organism, and in particu- 
lar from a naturally-occurring microorganism or a mutant or 
derivative thereof. Furthermore, the "variant" may be a gen- 
20 etically engineered variant, e.g. prepared by suitably mod- 
ifying a DNA sequence of the invention resulting in the addi- 
tion of one or more amino acid residues to either or both the 
N- and C-terminal end of the polypeptide encoded by the unmo- 
dified DNA sequence, substitution of one or more amino acid 
25 residues at one or more different sites in the amino acid se- 
quence, deletion of one or more amino acid residues at either 
or both ends of the polypeptide or at one or more sites in 
the amino acid sequence, or insertion of one or more amino a- 
cid residues at one or more sites in the amino acid sequence. 

30 

By use of the process of the invention it is possible to pro- 
duce enzyme preparations having a higher content of cr-galac- 
tosidase than what is possible by conventional fermentation 
of a parent microorganism, such as A, niger, inherently pro- 
as ducing the a-galactosidase. Furthermore, the resulting a-ga- 
lactosidase preparations are essentially free from any other 
components derived from the parent microorganism, in particu- 
lar components giving rise to undesirable enzymatic side-ac- 
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tivities. Accordingly, by use of the process of the invention 
it is possible to optimize the production of a-galactosidase 
enzyme components thereby producing an enzyme preparation 
with a higher specific a-galactosidase activity at lower cost 
5 than what is possible by methods known in the art. At the 
same time the undesirable production of oxalic acid may be 
substantially reduced or avoided. 

10 DETAILED DISCLOSURE OF THE INVENTION 

In the DNA construct of the invention, the analogue of the 
DNA sequence encoding a polypeptide having a-galactosidase 
activity may, for instance, be a subsequence of said DNA se- 

15 quence, a genetically engineered modification of said sequen- 
ce which may be prepared by well-known procedures, e.g. by 
site-directed mutagenesis, and/or a DNA sequence isolated 
from another organism and encoding an a-galactosidase enzyme 
with substantial similarity to the a-galactosidase having the 

20 amino acid sequence shown in SEQ ID No. 3. The actual sequen- 
ce of the analogue is not critical as long as the analogue 
has at least one of the properties i)-iii) listed above. 
These properties are further discussed below. 

25 Property i) , i.e. the hybridization of a DNA sequence with 
the DNA sequence shown in the SEQ ID No. 1 or 2 or with a 
suitable oligonucleotide probe prepared on the basis of said 
DNA sequences or on the basis of the polypeptide shown in SEQ 
ID No. 3 may be carried out under any suitable conditions al- 

30 lowing the DNA sequences to hybridize. For instance, 1 fig of 
total DNA expected to harbour an analogous DNA sequence is 
subjected to complete digestion with, e.g. EcoRI, BamHI or 
Hindlll, and applied to a 1% agarose gel. The DNA fragments 
are separated by electrophoresis, and then transferred to an 

35 Immobilon™-N membrane (Millipore Corporation) following the 
Manufacturers instructions. The membrane is prehybridized 
following the manufacturers instructions and then the DNA 
sequence shown in SEQ ID No. 1 or 2 or a representative frag- 
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ment thereof, labelled with 22^ by primer extension (Sambrook 
et al., 1989), is added as a probe, and the temperature re- 
duced to 45^C. After 18 hrs of hybridization the membrane is 
washed repeatedly in 6xSSC, 0.1% SDS at 45«»C. The membrane is 
5 then subjected to autoradiography and evaluated. 

Property ii) , i.e. the immunological cross reactivity may be 
assayed using an antibody raised against or reactive with at 
least one epitope of the a-galactosidase enzyme comprising 

10 the amino acid sequence shown in SEQ ID No. .3. The antibody, 
which may either be monoclonal or polyclonal, may be produced 
by methods known in the art, e.g. as described by Hudson et 
al., 1989. The immunological cross-reactivity may be determi- 
ned using assays known in the art, examples of which are We- 

15 stern Blotting or radial immunodiffusion assay, e.g. as de- 
scribed by Hudson et al., 1989. 

Property iii) may be determined by comparing the amino acid 
sequences of the polypeptide encoded by the analogue and the 

20 polypeptide sequence shown in SEQ ID No. 3 by use of well- 
known algorithms, such as the one described by Lipman and 
Pearson (1985). In the present context, "identity" is used in 
its conventional meaning, i.e. intended to indicate the num- 
ber of identical amino acid residues occupying similar posi- 

25 tions in the two (or more) amino acid sequences to be com- 
pared. 

It is believed that an identity of above 50% such as above 
about 70%, 75%, 80%, 90% and in particular above about 95% 

30 with the amino acid sequence shown in SEQ ID No. 3 is indica- 
tive for homology with the a-galactosidase encoded by the DNA 
sequences shown in SEQ ID Nos. 1 and 2. From an alignment 
study of the amino acid sequence shown in SEQ ID No. 3 and 
the amino acid sequence encoding the E. coli a-galactosidase 

35 disclosed by Aslandis et al., 1989 an identity of about 30% 
was found. As far as the present inventors are aware this is 
the only a-galactosidase with a known amino acid sequence 

\ 
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that show any comparable identity to the a-galactosidase en- 
coded by the DNA construct of the invention. 

It is well known that homology exists between polypeptides of 
5 different origins, and a-galactosidases homologous to a-ga- 
lactosidases from yeast have been found in plants as well as 
in mammals. Analogously herewith, it is contemplated that in 
the DNA construct of the invention, the DNA sequences may be 
derived from an animal including a mammal and an insect, a 
10 plant or a microorganism. In the present context, especially 
interesting origins are bacteria and fungi. The term "fungi" 
is intended to include yeasts and filamentous fungi* 

As stated above, the DNA sequences shown in SEQ ID Nos. 1 and 
15 2 encoding an a-galactosidase are derived from a fungus, more 
particularly from A. niger. It is contemplated that other fun- 
gal Q-galactosidases may show a substantial homology, either 
on the DNA or amino acid level, with the A. niger a-galacto- 
sidase disclosed herein, and accordingly, DNA sequences of 
20 the DNA construct of the invention may be derived from a fun- 
gus, in particular from a strain of Aspergillus such as from 
a strain of A. niger. An example of such strain is the strain 
of A. niger deposited with the American Type Culture Collec- 
tion under the number ATCC 16882. 

25 

When isolated from A. niger the a-galactosidase enzyme is 
contemplated to exist as a number of isoenzymes, presumably 
due to heavy glycosylation. It is expected that the a-galac- 
tosidase encoded by the DNA construct of the invention may be 
30 in the form of different isoenzymes, depending on the circum- 
stances under which it is produced, and in particular on the 
host cell in question producing the enzyme. 

In Example 1 below characteristic properties are described of 
35 an A. niger a-galactosidase enzyme (as isolated from A. 
niger) . It has surprisingly been found that some properties 
of an a-galactosidase expressed from a DNA construct of the 
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invention differ from the corresponding properties of the a- 
galactosidase isolated from A. niger. 

Thus, whereas the isolated a-galactosidase has a pH optimum 
5 in the range of 3.8-6.0, the a-galactosidase expressed from 
the DNA sequence shown in SEQ ID No. 2 in an Aspergillus ory- 
zae host cell has been found to have a pH optimum in the ran- 
ge of 5.0-7.0 (cf. Example 5 herinaf ter) . 

10 Based on the corresponding properties of the purified A. nig- 
er a-galactosidase, it is contemplated that an a-galactosid- 
ase enzyme encoded by the a DNA construct of the invention 
has a pi in the range of 4.0-5.0 (depending on the isoenzyme 
in question) such as about 4.3 as determined by lEF as de- 

15 scribed herein, a temperature optimum within the range of 50- 
70*>C, a molecular weight of about 170 kDa, and/or a specific 
activity of above about 250 GALU/mg protein. 1 GALU is the 
unit of a-galactosidase strength which is further defined in 
the materials and methods section below. 

20 

It will be understood that the preferred DNA construct of the 
invention is one, in which the DNA sequence is as shown in 
the appended SEQ ID No. 1 or 2 . 

25 The DNA sequence of the DNA construct of the invention may be 
isolated by well-known methods. Thus, the DNA sequence may, 
for instance, be isolated by establishing a cDNA or genomic 
library from an organism expected to harbour the sequence, 
e.g. a cell of any of the origins mentioned above, and scree- 

30 ning for positive clones by conventional procedures. Examples 
of such procedures are hybridization to oligonucleotide pro- 
bes synthesized on the basis of the full or partial amino 
acid sequence of the A. niger a-galactosidase comprising the 
amino acid sequence shown in SEQ ID No. 3 in accordance with 

35 standard techniques (cf. Sambrook et al., 1989), and/or se- 
lection for clones expressing an appropriate biological acti- 
vity as defined above, and/or selection for clones producing 
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a protein which is reactive with an antibody raised against 
the A. niger a-galactosidase. 

A preferred method of isolating a DNA construct of the inven- 
5 tion from a cDNA or genomic library is by use of polymerase 
chain reaction (PCR) using degenerate oligonucleotide probes 
prepared on the basis of the amino acid sequence shown in SEQ 
ID No. 3. For instance, the PCR may be carried out using the 
techniques described in US Patent No. 4,683,202 or by R.K. 
10 Saiki et al. (1988). 

Alternatively, the DNA sequence of the DNA construct of the 
invention may be prepared synthetically by established stan- 
dard methods, e.g. the phosphoamidite method . described by 
15 Beaucage and Caruthers (1981), or the method described by 
Matthes et al. (1984). According to the phosphoamidite 
method, oligonucleotides are synthesized, e.g. in an automa- 
tic DNA synthesizer, purified, annealed, ligated and cloned 
in appropriate vectors. 

20 

Finally, the DNA construct may be of mixed genomic and syn- 
thetic, mixed synthetic and CDNA or mixed genomic and CDNA 
origin prepared by ligating fragments of synthetic, genomic 
or CDNA origin (as appropriate) , the fragments corresponding 
25 to various parts of the entire recombinant DNA molecule, in 
accordance with standard techniques. 

As stated above, the DNA construct of the invention may also 
comprise a genetically modified DNA sequence. Such sequence 

30 may be prepared on the basis of a genomic or cDNA sequence of 
the invention, suitably modified at a site corresponding to 
the site(s) of the polypeptide at which it is desired to in- 
troduce amino acid substitutions, e.g. by site-directed mut- 
agenesis using synthetic oligonucleotides encoding the de- 

35 sired amino acid sequence for homologous recombination in 
accordance with well-known procedures, or by use of random 
mutagenesis, e.g. through radiation or chemical treatment. 



wo 94/23022 



PCT/DK94/00138 



10 

Examples of suitable modifications of the DNA sequence are 
nucleotide substitutions which do not give rise to another 
amino acid sequence of the polypeptide, but which may corre- 
spond to the codon usage of the host organism into which the 
5 recombinant DNA molecule is introduced (i.e. modifications 
which, when expressed, results in e.g. an a-galactosidase 
comprising the amino acid sequence shown in the appended SEQ 
ID No- 3), or nucleotide substitutions which do give rise to 
a different amino acid sequence and. therefore, possibly, a 

10 different polypeptide structure without, however, impairing 
properties of the polypeptide such as enzymatic properties 
thereof. Other examples of possible modifications are inser- 
tion of one or more nucleotides into the sequence, addition 
of one or more nucleotides at either end of the sequence and 

15 deletion of one or more nucleotides at either end of or with- 
in the sequence. 

The recombinant expression vector carrying the DNA construct 
of the invention may be any vector which may conveniently be 

20 subjected to recombinant DNA procedures, and the choice of 
vector will often depend on the host cell into which it is to 
be introduced. Thus, the vector may be an autonomously repli- 
cating vector, i.e. a vector which exists as an extrachromo- 
somal entity, the replication of which is independent of 

25 chromosomal replication, e.g. a plasmid or a bacteriophage. 
Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and 
replicated together with the chromosome (s) into which it has 
been integrated. 

30 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
sequence which shows transcriptional activity in the host 
cell of choice and may be derived from genes encoding prot- 
35 eins either homologous or heterologous to the host cell. For 
instance, examples of suitable promoters for directing the 
transcription of the DNA construct of the invention in a 
fungal host cell are the TAKA promoter and the triose phos- 



wo 94/23022 



PCT/DK94/00138 



11 

pha-te isoinerase promoter of Aspergillus oryzae, the amylogly- 
cosidase promoter and the glyceraldehyde-3-phosphate dehydro- 
genase promoter of Aspergillus niger and the cellobiohydrola- 
se I promoter of Trichoderme reseei. 

5 

The expression vector of the invention may also comprise a 
suitable terminator operably connected to the DNA construct 
of the invention. The terminator is suitably derived from the 
same source as the promoter of choice* < 

10 

The vector may further comprise a DNA sequence enabling the 
vector to replicate in the host cell in question. Examples of 
such sequences are the origins of replication of plasmids 
pUC19, pACyC177, pUBllO, pE194 , pAMBl and pIJ702, 

15 

The vector may also comprise a selectable marker, e.g. a gene 
the product of which complements a defect in the host cell, 
such as the dal genes from B.subtilis or B. lichenif ormis , or 
one which confers antibiotic resistance such as ampicillin, 
20 kanamycin, chloramphenicol or tetracyclin resistance. 

While intracellular expression may be advantageous in some 
respects, e.g. when using certain bacteria as host cells, it 
is generally preferred that the expression is extracellular. 
25 In order to obtain extracellular expression, the expression 
vector should normally further comprise a DNA sequence encod- 
ing a preregion, i.e. a signal peptide, permitting secretion 
of the expressed a-galactosidase or a variant thereof into 
the culture medium. 

30 

The procedures used to ligate the DNA construct of the inven- 
tion, the promoter, terminator and other elements, respect- 
ively, and to insert them into suitable vectors containing 
the information necessary for replication, are well known to 
35 persons skilled in the art (cf., for instance, Sambrook et 
al. (1989)). 
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The cell of the invention either comprising a DNA construct 
or an expression vector of the invention as defined above is 
advantageously used as a host cell in the recombinant produc- 
tion of a polypeptide of the invention. The cell may be tran- 
5 sformed with the DNA construct of the invention, conveniently 
by integrating the DNA construct in the host chromosome. This 
integration is generally considered to be an advantage as the 
DNA sequence is more likely to be stably maintained in the 
cell. Integration of the DNA constructs into the host chromo- 
10 some may be performed according to conventional methods, e.g. 
by homologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described below in 
connection with the different types of host cells. 

15 The cell of the invention may be a cell of a higher organism 
such as a mammal or an insect, but is preferably a microbial 
cell, e.g. a bacterial or a fungal (including yeast) cell 
which, on cultivation, produces large amounts of the poly- 
peptide. 

20 

Examples of suitable bacteria are grampositive bacteria such 
as Bacillus subtilis . Bacillus lichenif ormis , Bacillus lent- 
us . Bacillus brevis . Bacillus stearothermoohilus , Bacillus 
alkalophilus . Bacillus amyloliquef aciens . Bacillus coagulajns, 
25 Bacillus circulans . Bacillus lautus. Bacillus thurinaiensis 
or Streptomvces liyidans, Streotomvces murinus, or gramnega- 
tive bacteria such as E.coli . The transformation of the bac- 
teria may for instance be effected by protoplast transforma- 
tion or by using competent cells in a manner known per se . 

30 

The yeast organism may favourably be selected from a species 
of Saccharomvces or Schizosaccharomvces . e.g. Saccharomyces 
cerevisiae . The filamentous fungus may advantageously belong 
to a species of Aspergillus , e.g. Aspergillus orvzae or 
35 Aspergillus niger. Fungal cells may be transformed by a pro- 
cess involving protoplast formation and transformation of the 
protoplasts followed by regeneration of the cell wall in a 
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manner known per se. The use of Aspergillus as a host organ- 
ism is described in, e.g-/ EP 238 023. 

In a yet further aspect, the present invention relates to a 
5 method of producing a polypeptide of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the polypep- 
tide and recovering the polypeptide from the cells and/or 
culture medium. 

10 

The medium used to cultivate the cells may be any convention- 
al medium suitable for growing the host cell in question. 
Suitable media are available from commercial suppliers or may 
be prepared according to published recipes (e.g. in catalo- 
15 gues of the American Type Culture Collection) . 

The polypeptide may be recovered from the medium by conven- 
tional procedures including separating the cells from the me- 
dium by centrif ugation or filtration, if necessary after dis- 

20 ruption of the cells, precipitating the proteinaceous compo- 
nents of the supernatant or filtrate by means of a salt, e.g. 
ammonium sulphate, followed by purification by a variety of 
chromatographic procedures, e.g. ion exchange chromatography, 
affinity chromatography, or the like, the actual recovery 

25 method being dependant on the kind of polypeptide in question 
as well as the desired final purity thereof. 

Depending on the degree of purification of the polypeptide 
produced by the process of the invention, the resulting po- 

30 lypeptide preparation may contain minor amounts of other en- 
zymatic components inherently produced by the host cell used 
for the production. For instance, when a fungal cell, such as 
one of the genus Aspergillus, is used as a host cell for the 
production of a recombinant fungal a-galactosidase enzyme, 

35 certain of the enzymatic side-activities normally found in a- 
galactosidase preparations produced by conventional fermenta- 
tion of a parent fungal strain may also be produced and re- 
covered together with the recombinant polypeptide produced in 
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accordance with the present invention. An example of an enzy- 
me normally found in a-galactosidase preparations prepared by 
conventional techniques is the enzyme invertase. This enzyme 
is inherently produced by a number of Aspergillus strains and 
5 consequently may also be found in a-galactosidase prepara- 
tions produced by Aspergillus strains in accordance with the 
present invention, although in a considerably lower amount as 
compared to the a-galactosidase than what is observed in con- 
ventional fermentation. Thus, in the context of the present 
10 invention, a substantial increase in the ratio of a-galacto- 
sidase to other enzymatic activities may be obtained in addi- 
tion to the increased total yield of a-galactosidase. 

If it is desired to produce substantially pure a-galactosi- 
15 dase or alternatively a-galactosidase preparation free from 
certain undesired enzymatic side-activities (an example of 
which - for some uses of the a-galactosidase - is invertase) 
one may either remove the side-activity (ies) by purification 
or one may choose a production organism incapable of produc- 
20 ing the side-activity (ies) in question. 

The a-galactosidase encoded by the DNA construct of the in- 
vention may be used for a number of purposes involving hydro- 
lysis of a-galactosides to galactoses and sucroses. 

25 

For instance the a-galactosidase preparation encoded by the 
DNA construct and produced by the process of the invention 
may . be used for the hydrolysis of a-galactosides present in, 
e.g., plants or plant parts which, for instance, are intended 

30 for nutrition of mammals or for fermentation of microorgan- 
isms. As indicated above, such plants and plant parts com- 
prise legumes such as peas and beans, nuts, seeds, grains, 
cereals and vegetables including potatoes, beets and chicory, 
as well as processed products thereof including flour, meal, 

35 bran, molasses, etc. Thus, the a-galactosidase enzyme pre- 
pared according to the invention may be used for the pre- 
treatment of food or feed containing a-galactosides and for 
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modification of soy bean or sugar beet molasses used as a 
substrate in the fermentation of microorganisms. 

One important use of the a-galactosidase preparation prepared 
5 according to the invention is in the modification of soy 
beans or soy products such as soy bean molasses, soy bean 
sauce, soy bean milk, and soy bean whey. 

Accordingly, in a further aspect the present invention re^ 

10 lates to a method of preparing an enzyme-modified soy bean 
product comprising subjecting a composition containing the 
soy bean product to be modified to enzymatic treatment in the 
presence of an a-galactosidase preparation produced according 
to the invention. The enzymatic treatment may be performed by 

15 use of methods known in the art. For instance, soy bean meal 
may be modified by suspending the soy bean product in water 
so as to obtain a dry matter content in the resulting suspen- 
sion of about 15-20%, adjusting pH to about 4.5-6 and trea- 
ting the resulting suspension with 0.5% of an a-galactosidase 

20 preparation of the invention comprising about 500 GALU/g for 
2-8 hours at 50*>C. The resulting modified product may subse- 
quently be spraydried. Furthermore, soy bean products may be 
produced as described by Olsen et al., 1981 and Eriksen, 
1983, and the a-galactosidase preparation may be added, when 

25 appropriate, during the production. In the preparation of soy 
milk the a-galactosidase preparations may be added to the ex- 
tract resulting after separation of solid particles from the 
soy bean material or during evaporation or in the final con- 
centrated soy milk product. 

30 

Alternatively, a soy bean product may be treated by a method 
comprising 

a) inserting a DNA construct of the invention encoding an 
35 a-galactosidase, optionally present in a suitable express- 
ion vector, into a suitable host organism. 
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b) culturing the host organism in a suitable culture 
medium under conditions permitting expression of the 
polypeptide encoded by the DNA construct, and recovering 
the resulting polypeptide from the culture, and 

5 

c) subjecting a composition containing the soy bean pro- 
duct to be modified to enzymatic treatment in the presence 
of the polypeptide recovered in step b) . 

.10 Step a) and b) may be performed as disclosed herein. 

The a-galactosidase preparation produced according to the 
invention may further be used for the production of sugar 
from sugar beets in accordance with well-known procedures to 
15 improve the sugar yield by hydrolysing raffinose and stacchy- 
ose to galactose or sucrose* 

Another important use of the a-galactosidase prepared accord- 
ing to the invention is for the in vivo conversion of a-ga- 
20 lactoside-linked sugars in mammals, e.g. as described in WO 
90/14101. 

The a-galactosidase preparation may thus be used as digestive 
aid. For this purpose the a-galactosidase preparation may be 
25 combined with a suitable carrier or excipient so as to be in 
the form of a tablet, a capsule, a powder, a liquid, or in a 
soft-gel capsule form. The amount of a-galactosidase present 
in such formulations is in the range of 500-20000 GALU/G. 

30 In a further aspect the present invention relates to a food 
or feed comprising an a-galactosidase preparation prepared 
according to the invention. The a-galactosidase preparation 
is typically included in an amount corresponding to about 1- 
20 GALU/g of food or feed. Examples of food or feed in which 

35 the a-galactosidase preparation may be included is given 
above . 



t 
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The present invention is described in the following by refer- 
ence to the appended drawings, in which 

Fig. 1 illustrates the construction of pCaHj 413 as described 
5 in Example 4, 

Fig. 2 illustrates the construction of pCaHj 414 as described 
in Example 4, 

Fig. 3 illustrates the construction of pCaHj 424 as described 
in Example 4, 

10 Fig. 4 illustrates the pH optimum of a-galactosidase. 

Fig. 5 is a HPLC chroma togram illustrating the degradation of 
raffinose by a-galactosidase, and 

Fig. 6 is a HPLC chromatogram illustrating the degradation of 
stacchyose by a-galactosidase. 

15 

The present invention is further illustrated in the following 
examples, which are not^ in any manner, intended to limit the 
invention as disclosed herein. 

20 

MATERIALS AND METHODS 
Starting material 

The a-galactosidase preparation used in the following 
25 examples is a commercial A. niger a-galactosidase preparation 
(Alpha-Gal™, Batch KAN 0001) available from Novo Nordisk A/S, 
Denmark • 

Determination of a-6alactosidase Activity (6ALU) 
30 1 GALU is defined as the amount of a-galactosidase required 
for hydrolyzing 1 ^mole p-nitrophenyl a-D-galactopyranoside 
(to p-nitro phenol + galactose) in one minute under the fol- 
lowing conditions: 
Substrate: 0.80 mM p-NPGal 

35 pH: 5.5 - acetate buffer b.0333M 

Temperature : 3 7 C 
Reaction time: 15 min. 
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Reagents : 

!• BUFFER: Acetate buffer 0.05 M, pH 5.5 

2. SUBSTRATE: 1.2 mM p-Nitrophenyl-o-D-galactopyranoside 

3. STOP REAGENT: Borax - NaOH buffer 0.0625 M, pH 9.7 
5 4. COLOUR STANDARD: 4-Nitrophenol , 240 ftM 

Procedure 

A standard curve is prepared by mixing 2 ml of substrate and 
1 ml of various dilutions of colour standard (prepared with 
10 demineralized water) and adding 5 ml of stop reagent. When 
making the colour standard blank use demineralized water 
instead of colour standard. Measure OD405. 

Weigh and dilute the enzyme preparation to a concentration 
15 corresponding to an activity of about 0.0015 GALU/ml. 





Sample 


Sample blank 


Sample 


1 ml 


1 ml 


Preheat substrate 
for 5 minutes 


37*»C 




Add substrate 
(stop watch) 
and mix 


2 ml 




Incubation for 
15 minutes 


37«*C 


room temp. 


Add stop reagent 
and mix 


5 ml 


5 ml 


Substrate - 
room temperature 




2 ml 


Measure OD405 
within 30 minutes 







Calculation of Activity: 

Make the colour standard curve (aOD against concentration) . 
35 The activity is calculated according to the following for- 
mula: 



40 Act 



(As - Ab) • Fs • 10-^ 
T • M 
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where 



As = 



M = 



T = 



The reading on the standard curve in /xM 4-NP, corre- 
sponding to OD405 for the sample. 

The reading on the standard curve in /xM 4-NP, corre- 
sponding to OD405 for the sample blank. 
Dilution factor for the sample. 
Reaction time in minutes (= 15) . 
Amount of sample weighed out. 



10 10-^ 



Conversion factor 1/ml. 



Fed batch fermentation 

Fed batch fermentation was performed in a medium comprising 
maltodextrin as a carbon source, urea as a nitrogen source 

15 and yeast extract. The fed batch fermentation was performed 
by innoculating a shake flask culture of A. oryzae host cells 
in question into a medium comprising 3.5% of the carbon 
source and 0.5% of the nitrogen source. After 24 hours of 
cultivation at pH 7.0 and 34^C the continuous supply of addi- 

20 tional carbon and nitrogen sources were initiated. The carbon 
source was kept as the limiting factor and it was secured 
that oxygen was present in excess. The fed batch cultivation 
was continued for 4 days, after which the enzyme could be 
recovered by centrifugation, ultrafiltration, clear filtra- 

25 tion and germ filtration. 

Characterization of an enzyme of the invention 

pH optimum is measured by using 2mM PNP-a-galactosidase in 

0.1 M citrate/phosphate buffers pH 2.5-10 as a substrate. To 

30 0.5 mi substrate is added 10 /xl enzyme solution (lOOx diluted 
in 3 mg/ml BSA) , the mixture is incubated at 30*»C for 15 min- 
utes and the enzyme is heat-inactivated at 95**C. Three 
samples and one blank are prepared. 100 fil are pipetted into 
a microtiter plate well, 100 /xl IM tris buffer pH 7.0 are 

35 added and the absorbance is measured in the microtiter reader 
at 405 nm. Paranitrophenol is used as a standard. The speci- 
fic activity at the optimal pH is calculated. 
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Temperature stability is measured by leaving the enzyme sol- 
ution (in BSA or in 0.25% raffinose) at different tempera- 
tures for 1 and 2 hours before incubations are carried out at 
optimal pH in PNP-a-galactoside. Measurements are carried out 
5 as above. 

Specific activity towards raffinose is measured by carrying 
out incubations at optimal pH at different raffinose concen- 
trations (2--32 mM) . Released galactose is determined by the 
10 amount of reducing sugars. 

Reducing sugars are determined by reaction, in microtiter 
plates, with a PHBAH reagent comprising 0.15 g of para 
hydroxy benzoic acid hydrazide (Sigma H-9882), 0.50 g of po- 
15 tassium-sodium tartrate (Merck 8087) and 2% NaOH solution up 
to 10.0 ml. Results of blanks are subtracted. 

In order to test for activity towards raffinose and stacchy- 
ose with and without the presence of galactose and sucrose, 

20 solutions are mixed according to the table below. The buffer 
is 0.1 M acetate buffer at the optimal pH for each enzyme. 10 
fil of enzyme solution (diluted 10 times) is added and incuba- 
tions are carried out at 30 °C for 0, 1, 2, 4 and 2 4 hours. 25 
/xl of the supernatant is analysed on the Dionex HPLC system 

25 (PAl column, 0.12 M NaOH eluent, 1 ml/min flow rate. Pulsed 
Amperometric Detection) which separates all the saccharides. 
This experiment should also reveal if any transferase activ- 
ity can be ascribed to the a-galactosidases. , 

30 Experiment 

raff. 1% stach. 4% sucr. 10% gal. 1% buffer 



ftl 111 fil ill 

1. 200 800 

2. 200 800 
35 3. 200 800 

4. 200 200 600 

5. 200 200 600 

6. 200 200 600 

7. 200 200 200 400 



40 
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EXAMPLE 1 

Purification and characterization of a-»galactosidase from 
Aspergillus niger 

5 

Salt precipitation 

A sample of the a-galactosidase preparation was washed with 5 
volumes of ionwater in an Amicon-UF-cell (membrane GR 60PP, 
Cut Off 25.000). Salt precipitation was achieved by use of 
10 (NH4)2S04 at 60% saturation (385 g/1) , at which degree of sat- 
uration a-galactosidase had been shown to precipitate. The 
(NH4)2S04 was added slowly (more than one hour) under stirring 
at room temperature. The pH was kept constant at pH 5.5 by 
addition of a base. 

15 

The precipitate was redissolved in water and washed in an 
Amicon-UF-cell (membrane GR 60 PP) until a conductance of 
about 0.9 mS was reached.. 

20 lonexchanae 

The redissolved and washed precipitate was subjected to an- 
ionexchange on a DEAE-Sepharose-CL-6B column equilibrated 
with a citrate/phosphate buffer, pH 5.5 (0.002 M citric 
acid/ 0.006 M Na2HP04) , and a conductivity of about 0.9 mS. The 
25 a-galactosidase was eluted with 0-0.5 M NaCl and fractions 
containing a-galactosidase activity were pooled. 

Gelfiltration 

The pooled a-galactosidase fractions were concentrated 10 x 
30 to obtain a protein content of about 16 mg/ml. The gelfiltra- 
tion was performed on a Sephadex GlOO (Mw 4.000-150-000) gel- 
filtration column equilibrated with the buffer specified 
above - 

35 The a-galactosidase which was present in the front fraction 
and contained 5.6 mg of protein, was subsequently analysed 
for purity by use of the lEF Phast system and the SDS-PAGE 
Phast system as described below. 
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The specific activity of the front fraction was determed to 
264 GALU/mg Protein as described above. The protein content 
was determined spectrophotometrically at 280 nm. 

5 lEF 

The a-galactosidase fraction was run on an lEF-PAA pH 4-6.5 
(Pharmacia Phast System File Nos. 100 and 200). A strong bond 
could be observed at pH 4.3 and a weak shadow was observed at 
pH 4.2. It was concluded that the pi of the purified enzyme 
10 was 4.3. 

SDS-PAGE 

The a-galactosidase fraction was run on a SDS-gradient gel 
PAA 10-15 (Pharmacia Phast System as above) . Before the 

15 sample was loaded the protein was subjected to denaturation 
and reduction by boiling and addition of DTT (1, 4-Dithio-DL- 
threitol) . A strong bond was observed at Mw 90.000 and a 
shadow at Mw 100.000. When no boiling with DTT was performed 
the SDS-analysis resulted in a bond at Mw 170.000 indicating 

20 the molecular weight of the intact protein. The fact that the 
Mw of the intact protein is 170.000 is in accordance with the 
fact that the a-galactosidase was contained in the front 
fraction obtained from the gelf iltration analysis, in that 
the Mw of proteins contained in the front fraction would be 

25 expected to be higher than 150.000. 

It can thus be concluded that the a-galactosidase enzyme from 
A. niger described herein is a dimer of two protein chains 
each having a molecular weight of about 90.000. 

30 

EXAMPLE 2 

Preparation and characterization of a-galactosidase peptides 

Chemical degradation of a purified a-galactosidase prepara- 
35 tion with surplus CNBr was carried out in 70% HCOOH for 24 h 
at 25'*C. Enzymatic degradation using chymotrypsin was carried 
out in 0.05 M NH4HCO3, 2 M urea for 5 h at 37**C at an enzyme: 
substrate ratio of 1:40 (w:w) . Peptides were purified using 
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microbore reversed phase HPLC employing either C4 or C18 col- 
umns eluted with linear gradients of 75% aqueous 2-propanol 
in 0.1% aqueous TFA (trif louroacetic acid). Purified peptides 
were sequenced using an Applied Biosystems 473A protein 
5 sequencer . 

The following two peptides were obtained from chemical degra- 
dation with CNBr: 

10 CKBr-peptide 1: 

Gly-Ala-His-Leu-Ser-Ala-Val-Pro-Asn-Ala~Gln-Thr-Gly-Arg-Thr- 
Val-Pro-Ile-Thr-Phe-Arg-Ala-His-Val- (SEQ ID No. 4) 

CMBr-peptide 2: 

15 Asp-Asp-Gly-Trp-Phe-Gly-Asp-Lys-Tyr-Pro-Arg-Val-Ser-Asp-Asn- 
Ala-Gly-Leu-Gly-Asp-Asp- (SEQ ID No. 5) 

The following peptides were obtained from the enzymatic de- 
gradation using chymotrypsin: 

20 

Cbymotrypsin-peptide 1: 

Thr-Thr-Arg-Phe-Pro-Asp-Val-Leu-Trp (SEQ ID No. 6) 

Cbymotrypsin-peptide 2: 
25 Thr-Ser-Asp-Asn-Thr-Asp-Ala-'Ile-Asp-Arg-Ile-Thr-Ile-Gln-Phe 
(SEQ ID No. 7) 

Chymotrypsin-peptide 3: ^ 

Arg-Leu-Arg-Leu-Pro-Gln-Asp-Ser-Gln-Trp-Pro-Ala-Ala-Leu-Phe 
30 (SEQ ID No. 8) 

Cbymotrypsin-peptide 4 : 

Gly-Leu-Glu-Leu-Asp-Pro-Ala-Thr-Val-Glu-Gly-Asp-Glu-Ile-Val- 
Pro-Glu-Leu (SEQ ID No. 9) 

35 

Chymotrypsin-peptide 5: 

Val-Met-Asp-Asp"Gly-Trp-Phe-Gly-Asp-Lys-Tyr-Pro-Arg-Val-Ser- 
Asp-Asn-Ala-Gly- (SEQ ID No. 10) 
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It may be noted that amino acid residues 3-19 of the 
chymotrypsin-peptide 5 are present in CNBr-peptide 2 (amino 
acid residues 1-17) . 

5 EXAMPLE 3 

Cloning of an Aspergillus niger a-galactosidase 

Generation of an a-aalactosidase probe 

As noted in Example 2 above chymotrypsin-peptide 5 and CNBr- 
10 peptide 2 are overlapping. Together they reveal the peptide: 

VMDDGWFGDKYPRVSDNAGLGDD (SEQ ID No. 11) 

Polymerase chain reaction (PCR) primers were designed in 
15 order to amplify the DNA sequence encoding this peptide 
sequence. 

In the 5' end (sense strand) the following degenerate primer 
was used: 

20 

5/ TTACTAGTNATGGAYGAYGGNTGGTT 3' (5'#1: 64 species) (SEQ ID 
No. 12). 

A Spe I site (ACTAGT) was anchored in the 5' end of this 
25 primer. 

In the 3' end (sense strand) the following degenerate primers 
were used: 

5' TTGAGCTCRTCNCCYATU^CCNGCRTT 3' (3'#1: 512 species) 

30 (SEQ ID No. 13) . 

5' TTGAGCTCRTCNCCNAGNCCNGCGTT 3' (3 '#2: 512 species) 

(SEQ ID No. 14) 

35 5' TTGAGCTCRTCNCCNAGNCCNGCATT 3' (3 '#3: 512 species) 

(SEQ ID No. 15) 

5'#1 was used together with either 3'#1, 3 '#2 or 3 '#3. 
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Genomic DNA was prepared from A. niger (ATCC 16882) as 
described by Leach et. al*, 1986, 

This DNA was used as template in the PGR reactions (0.05 fig 
5 genomic DNA, 100 pmol of each degenerate primer, 200 ftM of 
dATP, dCTP, dGTP and dTTP, 1.5 mM MgClj, 50 mM KCl, 0.01% ge- 
latine, 10 mM Tris-HCl pH 8.3 in a total volume of 100 fil) , 
and the following PGR program was run: 

10 94^C for 2 min., 1 cycle (0.5 /il of amplitaq' tag polymerase 
(Perkin Elmer - Cetus) was added during this incubation) . 

94*C for 1 min., 50*>C for 1 min., 72*^0 for 2 min., 30 cycles. 

15 for 5 min., 1 cycle. 

The products of the PGR amplifications were concentrated and 
run on an agarose gel* In the amplifications employing 3'#1 
and 3 '#3 no product except for 'primer dimer' was seen, but 
in the amplification employing 3 '#2 a distinct fragment of 

20 approx. 80 bp. was seen. This fragment was isolated, digested 
with the restriction enzymes Spel and Sad and ligated to the 
vector pUC19 (Yanish-Perron et al., 1985) digested with Xbal 
and Sad. The ligation mixture was transformed into Escheri-^ 
chia coli MG 1000 (Gasadaban et al., 1980) made r"m* by con- 

25 ventional methods. 

Plasmid DNA isolated from a transformant was sequenced using 
the Sequenase kit (United States Biochemicals) following the 
manufacturers instructions. The sequence showed that the do- 
30 ned PGR fragment actually encoded the peptide fragment de- 
scribed above. The insert (86bp) of this plasmid was used as 
a probe in order to clone the a-galactosidase gene. 

Labelling of the probe 
35 A radioactive labelled probe was prepared in the following 
way: 5 ^g of the plasmid was digested with EcoRI and Sail and 
the 86 bp fragment was isolated from an agarose gel and dis- 
solved in 20 ^1 water. This was used as a template in a PGR 
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reaction including 2 ^1 of the fragment, 50 pmol primer 5'fl, 
50 pmol primer 3'#2, 10 pmol a^^PdATP (3000 Ci/mmol) (DuPont 
NEG-012H), 10 pmol dTTP, 10 pmol dCTP, 10 pmol dGTP, 1.5 mM 
MgClj, 50 mM KCl, 0.01% gelatine, 10 mM Tris-HCl pH 8.3 in a 
5 total volume of 100 /xl. 

The following temperature cycling program was run: 

94*'C for 2 min., 1 cycle (0.5 |xl of amplitaq' tag polymerase 
10 (Perkin Elmer - Cetus) was added during this incubation) . 

94^0 for 1 min., 50*»C for 1 min., 72^C for 2 min., 30 cycles. 

72^C for 5 min., 1 cycle. 

15 

The labelled fragment was isolated using a Sephadex G5p spun 
column as described by Maniatis et. al. (Maniatis et al., 
1982). The probe was heat denatured for 5 min, 100**C, and 
then added to the hybridization mixture. 

20 

Genome cloning of the g-galactosidase 

Genomic DNA from A. niger was prepared as described above, 
and .digested with various restriction enzymes, and the diges- 
tions were used for Southern blot analysis using the 
25 described a-galactosidase probe. 

A 4.5 kb BamHI fragment hybridized to the probe. This frag- 
ment was cloned the in the following way: 

30 A. niger genomic DNA was digested with BamHI, and fragments 
of 4-5 kb were isolated from an agarose gel. The fragments 
were ligated to pUC19 digested with BamHI and 
dephosphorylated with calf intestine alcaline phosphatase. 
The ligation mixture used to transform coli using 

35 ampicillin selection. 5000 clones were screened for the 4.5 
kb a-galactosidase fragment by colony hybridization using 
the described a-galactosidase probe, and hybridizing clones 
were selected. 
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Sequence analysis using the primers 3710 and 3711 of plasmid 
DNA isolated from one of these clones confirmed that the 
cloned fragment contained an a-galactosidase encoding 
sequence. This plasmid was termed pCaHj409* Sequence deduced 
5 from the M13 universal primer (United States Biochemicals) 
revealed that the 3' end of the gene was missing. 

2178 bp of the insert covering the cloned part of the a-ga- 
lactosidase gene was sequenced from both strands using vari- 
10 ous primers. 

3710 5' GCGTTATCGGACACTCG 3' (SEQ ID No. 16) 

3711 5' GTTTGGGGACAAGTACC 3' (SEQ ID No. 17) 

15 cDNA cloning bv PGR 

mRNA was prepared by guanidinium thiocyanate extraction fol- 
lowed by centrifugation in cesium chloride solution as 
described by Sambrook et. al^ 1989, using fresh mycelium. 

20 First strand cDNA was synthesized from an oligo dT primer 
using the BRL superscript cDNA kit following the manufactu- 
rers instructions. 

The CDNA gene was cloned as a 5' fragment and a 3' fragment 
25 using the rapid amplification of cDNA ends (RACE) method as 
described by Frohman, 1990. 

The primer 3710 was used as a secjuence specific primer for 
amplification of the 5' end, and 3711 was used as a sequence 
30 specific primer for amplification of the 3' end. In both 
cases the primers 2010 and 4433 were used as hybrid oligo dT 
primer and adaptor primer, respectively. 

2010 5 * CTGCAGTCGACTCTAGAGGATCCGCGGCCGCTTTTTTTTTTTT 
35 TTTTTTTTTTTT 3* (SEQ ID No. 18) 

4433 5* TTACTGCAGTCGACTCTAGAGGATCCGCG 3' (SEQ ID No. 19) 
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Composition of PGR reaction mixtures and the cycling profiles 
were as described by Frohman, op cit . 

The obtained 430 bp 5' fragment was digested with BaxnHI and 
5 Xhol and ligated to pUC19 digested with BamHI and Sail. The 
ligation mixture was transformed into E. coli using ampicil- 
lin selection. A plasmid containing an insert was sequenced 
from both strands using various primers. The sequence con- 
firmed that the fragment was an a-galactosidase cDNA frag- 
10 roent. 

The obtained 1300 bp 3' fragment was digested with Xhol and 
Xbal and ligated to pUC 19 digested with Sal I and Xba I. The 
ligation mixture was transformed into E. coli using ampicil- 
15 lin selection. A 1300 bp insert from a plasmid was confirmed 
to be an a-galactosidase fragment by sequence analysis from 
both strands using various primers. This plasmid was termed 
pCaHj 410. 

20 The genomic sequence and the cDNA sequence are shown in SEQ 
ID Nos. 1 and 2, respectively. The nucleotide fragments 302- 
371, 628-716, 978-1032 of the genomic sequence represent in- 
tron sequences. 

25 The a-galactosidase protein sequence showed about 30% 
homology to the E. coli a-galactosidase encoded by the gene 
rafA (Aslandis et al., 1989). 

30 EXAMPLE 4 

Expression of the a-galactosidase 

Construction of a-qalactosidase expression vectors 
The plasmid pCaHj 409 was digested with Sal I and Pst I, and 
35 a 1.5 kb fragment was isolated and ligated to pUC 19 digested 
with Sal I and Pst I. After transformation into E. coli and 
isolation of plasmid, the resulting plasmid was digested with 
Sal I and EcoR I, and the 4.2 kb fragment was isolated. pCaHj 
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410 was digested with EcoR I and Sal I, and the 0.8 kb frag- 
ment was isolated and inserted into to the 4*2 kb fragment 
described above. The resulting plasiaid was termed pCaHj 412. 
This plasmid was digested with ApaL I, the 3' recessed ends 
5 were filled in using the Klenow polymerase, and after phe- 
nol/chloroform extraction the mixture was digested with Hind 
III. The resulting 2.2 kb fragment was isolated. 

The Aspergillus expression plasmid pToC 68 (described in WO 
10 91/17243) was digested with Bgl II, the 3* recessed ends were 
filled in using the Klenow polymerase, and after phenol/chlo- 
roform extraction the mixture was digested with Hind III. The 
4.6 kb fragment was isolated and ligated to the 2.2 kb frag- 
ment described above. The resulting plasmid, termed pCaHj 
15 413, contained a part of the aglN gene fused to the termina- 
tor of the amyloglycosidase gene of A. niger (Tamg) . The con- 
struction of pCaHj 413 is summarized in Fig. 1. 

pCaHj 413 was digested with Hind III and Xho I, and the 4.1 
20 kb fragment was isolated. pCaHj 409 was digested with Hind 
III and Xho I, and the 4.0 kb fragment containing the 5* end 
of the aglN gene was isolated and ligated to the pCaHj 413 
fragment. The resulting expression plasmid, termed pCaHj 414, 
contained the aglN promoter followed by the aglN gene fused 
25 to the AMG terminator- The construction of pCaHj 414 is sum- 
marized in Fig. 2. 

pMT 1560 (4169 bp) was derived from pHD 414 (described in WO 
92/16634) by replacing the 617 bp BamH I - EcoR I fragment of 
30 pHD 414 with the BamH I - EcoR I digested PCR fragment ob- 
tained from a PCR reaction using pHD 414 as a template and 
the primers: 

5 ' GCTCCTCATGGTGGATCCCCAGTTGTGTATATAGAGGATTGAGGAAGGAAGAGAAGTG- 
35 TGGATAGAGGTAAATTGAGTTGGAAACTCCAAGCATGGCATCCCTTGC 3' 106 mer 
(SEQ ID No. 20) , and 

5'TGTTCTGGCTGTGGTGTACAGG 3' 22mer (SEQ ID No. 21). 
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pMT 1560 was digested with Nco I and Hind III, and the 3.9 kb 
fragment was isolated. pCaHj 414 was digested with Nco I and 
Hind III, and the 5.2 kb fragment containing the aglN gene 
was isolated and inserted into the 3.9 kb pMT 1560 fragment. 
5 The resulting plasmid was termed pCaHj 419. This plasmid was 
digested with Hind III and Xho I and the 5.2 kb containing 
the TAKA promotor of A. oryzae and the 3' end of the agIN 
gene fused to the AMG terminator was isolated. 

10 pCaHj 414 was used as a PCR template together with the 
primers 3710 and 4982 (containing a Hind III site followed by 
the ATG start codon of the aglN gene) : 

3710 5' GCGTTATCGGACACTCG 3* (SEQ ID No. 16) 

15 

4982 5* GCAAGCTTTATCATCACCACCATGAT 3* (SEQ ID No. 22) 

The PCR conditions were as described in Example 3 above (in 
'^Generation of an a-galactosidase probe"). The PCR fragment 

20 was digested with Hind III and Xho I and inserted into the 
5.2 kb pCaHj 419 fragment. The resulting expression plasmid 
was termed pCaHj 424 and contained the aglN gene fused to the 
TAKA promotor in the 5' end and to the AMG terminator in the 
3' end. The construction of pCaHj 424 is summarized in figure 

25 3. 

Transformation of A. oryzae 

The plasmid pCaHj 414 was transformed into Aspergillus ory- 
zae IFO 4177 using selection on acetamide by cotransf ormation 
30 with pToC 90 as described in WO 91/17243. 

By cultivation in shake flasks or in submerged tank fermenta- 
tion of the cotransformants activity was accumulated in the 
broth . 

35 

pCaHj 424 was transformed into A. oryzae IFO 4177 using the 
same method. Cotransformants expressed significantly higher 
amounts of a-galactosidase than pCaHj 414 transf ormants. 
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Purification of g-aalactosidase 

The culture supernatant from fermentation of Aspergillus or- 
yzae expressing the recombinant enzyme is centrifuged and 
filtered through a 0.2^im filter to remove the roycelia. 
5 35-50 ml of the filtered supernatant (30-60 mg a-galactosi- 
dase) are ultraf iltrated in a Filtron ultracette or Amicon 
ultrafiltration device with a 10 kDa membrane to achieve 10 
fold concentration. This concentrate is diluted 100 times in 
25 mM Tris pH 8.0 in two successive rounds of ultrafiltration 

10 in the same device. This ultraf iltrated sample is loaded at 
1.5 ml/min on a Pharmacia HR16/20 Fast Flow Q Sepharose anion 
exchanger equilibrated in 25 mM Tris pH 8.0. After the sample 
has been applied, the column is washed with two column vol- 
umes 25 mM Tris pH 8*0, and bound proteins are eluted with a 

15 linear increasing NaCl gradient from 0 to 0-6 M NaCl in 25 mM 
Tris pH 8.0. a-galactosidase elutes at approximately 0.25-0.3 
M NaCl, but the enzyme in this fraction is not completely 
pure (approximately 80% purity) . Thus, the a-galactosidase 
containing fractions were concentrated by ultrafiltration in 

20 Amicon ultrafiltration device with a 10 kDa membrane to a 
volume of 4.5 ml and applied to a HR 26/60 Sephacryl S200 
gelf iltration column in 0.25 M amonium acetate pH 5.5 at a 
constant flow of 1 ml/min. a-galactosidase is eluted as one 
distinct peak with a purity of approximately 90%. In order to 

25 achieve material purified to electrophoretic homogeneity, the 
a-galactosidase containing fractions are pooled, and ultra- 
filtrated into 10 mM sodium phosphate pH 6.8. The sample is 
applied onto a 8 ml BioRad HTP hydroxyl apatite column (10 mm 
internal diameter) at a constant flow rate of 1 ml/min. Bound 

30 enzymes are eluted by increasing the sodium phosphate concen- 
tration from 10 mM to 0.2 M over 40 min. a-galactosidase 
elutes at approximately 0.1 M sodium phosphate, and is more 
than 95% pure in this fraction. 

35 
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EXAMPLE 5 

Characterization of a-galactosidase 

5 The following properties of the a-galactosidase expressed in 
and purified from A. oryzaB were determined by the methods 
described in the Materials and Methods section above. 

The results obtained can be summarized in the following 
10 table: 

Mw 95 kDa 

pH-optimum 6.0 

15 

Stability in water very stable 

temperature stability < 60**C 
in BSA for 1 hour 

20 

temperature stability < 70®C 
in presence of raf f inose 

specific activity towards 
25 (/imol/mg enzyroe/min) 

a) PNP-a-galactosidase 90 

b) raff inose 14 5 (100) 

c) stacchyose (350) 
30 d) guar gum (0) 

inhibition by galactose No 

transferase activity No 

35 

Results in brackets are calculated from the HPLC results. 



pH optimum 

The pH optimum which is seen in Fig. 4 shows that the enzyme 
40 is most active at pH 6 but retains some activity in the whole 
range from pH 4-8. This is surprising in that the enzyme iso- 
lated from A. niger has a pH optimum in the range of 4-6. 



45 
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Degradation of stacchvose and raffinose and HPLC analysis 
From the HPLC chromatograms in Fig. 5 and 6 it is seen that 
degradation of raffinose (peak 4) is completed within 24 
hours the reaction products being sucrose (peak 39, galactose 
5 (peak 1) and small amounts of fructose (peak 2) • The degrada- 
tion of stacchyose results in the formation of raffinose 
(peak 4), sucrose (peak 39 and galactose (peak 1). After 24 
hours all stacchyose and raffinose has been converted into 
sucrose, galactose and small amounts of fructose. 

10 

It was surprisingly found that the enzyme was not inhibited 
by galactose. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 
5 (A) NAME: NOVO NORDISK A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) POSTAL CODE (ZIP) ; DK-2880 
10 (G) TELEPHONE: +45 44448888 

(H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 

(ii) TITLE OF INVENTION: A- niger alpha-galactosidase 
15 (iii) NUMBER OF SEQUENCES: 22 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPmWlTING SYSTEM: PC-DOS /MS-DOS 

20 (D) SOFTWARE: Patentin Release #1,0, Version #1.25 

(EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GmsTcccPG ca^asrroGG gaaigagaag toggochgcc aaocxxsgagt ggggg^vtgat go 

GCOCN3C3^ AAACrOGAXA CXXHOXIAIG mUJUUUGA 1CX343TCCfG AODGTOCXSGG 120 
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GA33^AAA3GC OSCTGAGAGG AAGAACIGCT GCITCACICA OCTCOGAGAT GIUTUiGGAT 180 

TCAOQGCCAG CAKEATCA3C AOCAOCftlGA TOOGPCTIOC CMGCnGIGG TQOCnGGQOC 240 

TnTEAOSIT AmEGTCftT TCTOCAGACA 03CO0QCMC TOQGGnTTCA AAOCET^CftGA 300 

GmUOGMnG OOGCOOCC3CA OGAIGCTACT OCAAOGCITG AICriCATlC GCfGJMGCT 360 

5 GAOCAAAC?^ CGAICGTEAC GAA3GGC30* AGTITOCEAT TCAAQQGOGA CAAIGICICA 420 

TMC3GA3TOC AICTCAACAG TADC2m3QC GACTIGMTr CIGATCATIT TOGIGGIGIC 480 

GICICXX3QCA CAAiaXTIC GOCAGIGGAA (TIGCIGrCA ADQGCIGGGT CXaOCMGOCr 540 

GGTOGAATOC GOOGOGAGTr OOOOGACTAA OGCXCTGQQG AmoaQCAT CaXtXTCTT 600 

ajEATTOGOG PJmX30C3^ TrATACiUiT A3CXMCrOC MMt^IUItJG CAOGAQGIGA 660 

10 TOGAAGCTAA AAAIQCITIG aXOQQOCIG OCIGOCACOT TIGGOGAIGC GCAGGCICTC 720 

ACAACmGG TAGTOGATCT CTATCACAAC TKEAGCTOOS TOGOGGOO^ CITCTCATAC 780 

TOCATATTTC OGAAA3MGA TCOGATOCTS AGGAGnGTCA AIGIGATCAA OCAGGGOXA 840 

GGEAA3MCA COMOGAGGC CmGCAAGC AaJAAGEAIOG AITIOCXnA TCAAGAOCIC 900 

GACAIGGICA GCXHOOGAGG CCAdGGGOC AGAGAGGC7A /m?ITCAGAG AAGCAAAGOG 960 

15 CAGrMGOOG TOCRGGQSIA NSKSMdOWi GCKCAAAAOC GACATOGTCA CXOTDCIGAC 1020 

GQGAGAGE^ MTOQGAAGC AGEACTOGAT MTOCTCrCA OCTTCT^^^ OULTIOJI'IU 1080 

OCT^XAGTAG^ TCXT^GCXACT iMDCCAAIOSC AAGGCCAGGC AIGGQGmC AAQ^ITCTAT 1140 

POfiOOOGCrC 'mUlUUGCC CAAOB^GAGA AAOGATOGCA AQ LgmCA OC GQGGOGCIOC 1200 

TOGGCTTCAA OOOQGAOCM TDmXJIGGA AOCTTOOCXC TOGOGAGACT TEAAOTIOCX: 1260 

20 OOGAGIGICT TGCAGICTAC TQQGACAAAG GOCTITOGCIC AGnCTCTOGC AAAITCrAOC 1320 

GGCIAIATOG CAADCADCTC MGAAGAGCA AGTrOOOCAC GTOOGADOGG CXI3GITCIGC 1380 

TCAAaaGCIG GGAAGGrEGIT TATITOGACT ACAATCAAAG CAGCATOGAA ACTCTIX300G 1440 

AAGAGTOOGC TGOOCIGQCT GTOCACXTCr TIGTCAIGGA OGAOGGCIGG TTIGGQGACA 1500 

AGTAOOCTOG ASIUIOOGAT AAC3G000GAC TGGGCXSACTG GAIGOOCAAT OCMCDGOGCT 1560 

25 TOCXDQGADQG CTTCADOOOG GTOGIGCAAG ACMCACAAA TCTCftDanC AAIGGCACAG 1620 

AGICEACAAA ALTlOGCriT GCTATTrOGG TGGAGOOCXSA GATOGTCAAC OXAAITOCA 1680 

CrciCEAOCA OGAACAOOOG GAGTOOGOGC TTCATOCm; GCXTEACXXX: CGTIAaDGAGC 1740 

GKDGAADCA CCIXXnUJrC AAOdOGOQC TTOaOGCICT GCAGGACTIC ATCKEAGACT 1800 

TCAIGAOGAA CXJIGriACAA GATAOOGGCA TTFOCrrAOGr CT^AfflGGGAC AACAAODGGG 1860 

30 GAAJEACAOGA GAOGOCXTICT OOGTOGACIG AOCATCAGEA CAIGCITOGC CICEAOOQOG 1920 

TCITOGACAC ACIGACX3V0C CXXTTOOOGG AIGIOCIGIG GGAAGGATCT GOCTOGGGTC 1980 

GAGGCX3GCIT TGA3GCIGGC AIGCIGCAGr AIGiaXXrA GATCIQ3\Cr TOOGAIAACA 2040 

CXEAOGOCAT OGAOOGAATC AOCATOCAAT TIGGGAOCTC GdTGCrTAC OaSCXMCAG 2100 

CAA3GGGIGC OCAOCIUrCX: GOGGTKXICA ACX3CACAGAC OGGTOGGACT GIGOOCMTA 2160 

35 CmOUGOGC ACADGTIGCr AIGATOGGIG GITCma3G CTIGGAGCIG GAIOOQGOGA 2220 

OGGIGGAAGG GGAOGAAAIA GITOOOGAGC TKTIGOQCr OGOQGAAAAA GIGAACXXnA 2280 

TCMTnGAA CGGAGATCIG TATOOQCIAC GOOAOCICA ASACKXXAG TQGCTTGCAG 2340 

CAClCmur GACTCAGGAT GQOQCACAGG CIGTICIUIT CTACITCAGG TOCAGOCEAA 2400 
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TGrrCAAOCAT GOQGCXnGGG TCTVSGCIGCr GOOSTUQGAC CIAMSGCSGAC TTm^OnTS^ 2460 
TGGAGAa?^ CKTOOG , 2476 

(2) INFORMATION FOR SEQ ID NO: 2: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
10 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

15 (A) ORGANISM: A. niger 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

MEMDQCTC TKXXSmSCr GTOGIGCCIG GOOdTFITA OnTMACXSG TC3flTCIGCA 60 

GACAOQOCOG CAACIGGGGT TTCAAAOXA CPCMXXTOG TIA0GAAIG6 CACD^GITIC 120 

OGAITCAAOG GOGACAAIGT CTCNTNiaSi TIOCMCTCA ACAGEAOCaC OOGOGACnG 180 

20 ATITCIGATC ATTTIGGriGG TGrTCGICroC GGCACAMOC CTTCOCCNjr GGAAOCIGCT 240 

GTCAMGGCr GGGnOQGCAT GOCTOGrTOGA ATOC3GO0QGG AGTKXrDGA CCAW3G0CCT 300 

GGOGftTTroC GCKiraXGC OCTIOSIMT OSGGAATOGG CM3GnMAC TSCnGICACA 360 

AdTIGGiaG TC3CMCIGTA TGACAACEAT AGCTOOGPOG CQGOOSAdT GTCaTACIOC 420 

ATAinoaGA AAIKDSATGC GATCGTGAGG ACSGTCAAIG TGATC3^A0CA GGGaXTVGGT 480 

25 AMMCACEA TOGW3GO0CT TQCAAGCATA AGEMOGMT TOOCXnMGA AGAOnDGAC 540 

2aasrCM30C TOCGAGGOSA CIGGOOCAGA GAGGCAAAIG TTCT^GAGAAG CAAAGIGCAG 600 

TKiaXCTCC AQOGftTTOGG AflGCAGIACr GGKECTTCCT CTCftOCTICA CAAIOOOTIt; 660 

CriGOCATAG IXKTOCNjC mnJACDGAA TCXXAfiGGOG AGQCaTOOGG TTTCAAOdT 720 

GEAIAXAOCG GCICmCIC GGOOCAAOIA GAGAAAGG^ aOCAAGGITT CACXXGQQOS 780 

30 CIGCroQGCr TCAACXn3GA CXTNATEAiaS T3GAADCnG GOOCTQGOGA GAdTEAACT 840 

TCXXXXEAGT GICTIGCAGr CEACKX3GAC AAAGGCXTIG GCrC2^GrGTC TOGGAAATIC 900 

CAOOOGCEAT ATOGGAADCA OCTCATCAAG AGCAAGnOG OCAOG?rOOGA COQGCXI^CTT 960 

CIQCIGAATA GdGGGAAGG 'lUiTEAlTIC GACIT^CAATC AAAGCAGCAT CGAAACICIT 1020 

GOCXSAAGAGE COGCIGOOCT GGGTCTOCAC CIUITICTCA TGGAOGAOSG CIGGITIGQG 1080 

35 GACAAGTEAOC CZOGAGIGIC OGATAAOGOC GGAdGOOOG AdGGKIGOC CAAIXXAGOS 1140 

OQLTIGUXG AOOQGnGAC COaOGianG CAAGACATCA CAA?aCTCAC OSTCAAIOGC 1200 

ACaGAGTOCA CAAAACITCi; CITIGO'IMT TQGGIGGAGC CCX^AGAIGGT CAAOXXS^ 1260 

TCxacTcrcr acxssogaaca cooqgagigg gogctkmg oooGGOcnA axxxxs m x: 1320 
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GAGOGTOOGA AOCftGCrCCT OCTCAAOCIG GOGCTIXXGG CICTGCftGGA CITCMCKIA 1380 

GACITC?m]A CXy^AOCrcnT ACAAGATftCX: GGCftlTKCT AOSTCAAAaG GGACAACAAC 1440 

aSOOGATOaC AOGftGAOGCr CIL'IUJC'ICG ACTSAOCaTC AGIMaaGCT TOiXXTCTftC 1500 

OGGG?IGriT0S ACftCRCDSftC eaOXULTIC CXX3GMCT0C TgDOGGAftOG MUiUOLTUG 1560 

5 GGIOGftGQOC GCTnCMGC TOGCATOCIG CMIMCTOC CXXMMCIG GACITCXXaT 1620 

AAOmSACG OCMOGftDCG AMCACEMC CAMTITOOGA OCTOQCITOC CmXXDGCXA 1680 

TCftGCAMGG GIGCXXftOCT CIOCDCXSGriT OCTAAOSCAC ASAOOQGIOG CACTSIGOOC 1740 

APiAcrnoc GcrxACT^asr tgcekigmg GGrocsrrciT tooqctigga GdGG^mxG isoo 

G0GA09GI0G AMQOGADGA AftaMTIiax: GfiGOTUEIG (DGCIGGCOGA AAAAGIGAAC 1860 

10 CEOMCSaTr TGAAOQGAGA TCICTMa3G CIADGOCIAC CICAAGACTC CO^SIGGOCT 1920 

GC3^GCaCICr TICTGACTCA GGKIQGOQCA C7«3CICTIC TCnrciACIT CS^GGia^GC 1980 

CEAMGICAA OCSaGOOGOG TOGGnOGQC IGCIGGGGIT GGAOCEAA 2028 

(2) INFORMATION FOR SEQ ID NO: 3: 
15 (i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 676 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) 
(iii) 
(iii) 

(V) 

(vi) 



(D) TOPOLOGY: linear 
MOLECULE TYPE: protein 



HYPOTHETICAL: NO 



ANTI-SENSE: NO 



FRAGMENT TYPE: internal 



ORIGINAL SOURCE: 



25 



(xi) 



(A) ORGANISM: A* niger 

SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met lie Gly Leu Pro Met Lai Tcp cys Lai Gly Lai Fte Bir Lai lyr 
15 10 15 



30 



Gly His Ser Ala Asp Unr Pro Ala Uir Gly Val Ser Asn Pro Gin Ihr 
20 25 30 



35 



He Val Ihr Asn Gly Thr Ser Hie Arg Lai Asn Gly Asp Asn Val Ser 
35 40 45 



lyr Arg Fhe His Val Asn Ser Ihr Ihr Gly A^ Leu He Ser Aqp His 
50 55 60 
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Phe Gly Gly Val Val Ser Gly Ihr He Pro Ser Pro Val Glu Pro Ala 
65 70 75 80 



Val Asn Gly Trp Val Gly Met Pro Gly Arg lie Arg Arg Glu Hie Pro 
85 90 95 

Gin Gly Arg Gly A^ Phe Arg lie Pro Ala Val Arg lie Arg Glu 
100 105 110 

Ser Ala Gly Bir Ala Val Ohr Bur Leu Val Val His Leu Hyr Asp 
115 120 125 

Asn Tyr Ser Ser Val Ala Ala Asp Leu Ser lyr Ser lie Fhe Pro Lys 
130 135 140 

lyr Aqp Ala lie Val Arg Ser Val Asn Val lie Asn Gin Gly Pro Gly 
145 150 155 160 

Asn He Ihr He Glu Ala Leu Ala Ser He Ser He A^ Phe Pro Tyr 
165 170 175 

Glu Asp Leu Asp Met Val Ser Leu Arg Gly A^ Trp Ala Arg Glu Ala 
180 185 190 

Asn Val Gin Arg Ser Lys Val Gin lyr Gly Val Gin Gly Hie Gly Ser 
195 200 205 

Ser Ihr Gly Tyr Ser Ser His Leu His Asn Pro Phe Leu Ala He Val 
210 215 220 

Aqp Pro Ala Ohr Ihr Glu Ser Gin Gly Glu Ala Trp Gly Phe Asn Leu 
225 230 235 240 

Val Tyr Ihr Gly Ser Kie Ser Ala Gin Val Glu Lys Gly Ser Gin Gly 
245 250 255 

Phe Ihr Arg Ala Leu Leu Gly Phe Asn Pro Asp Gin Leu Ser Trp Asn 
260 265 270 

Leu Gly Pro Gly Glu Ihr Laa Ihr Ser Pro Glu cys Val Ala Val Tyr 
275 280 285 

Ser A^ Lys Gly Leu Gly Ser Val Ser Arg Lys Ihe His Arg Leu Tyr 
290 295 300 

Arg Asn His Leu Met Lys Ser Lys Fhe Ala Ihr Ser Asp Arg Pro Val 
305 310 , 315 320 

Leu Leu Asn Ser Trp Glu Gly Val Tyr Ihe A^ Tyr. Asn Gin Ser Ser 
325 330 335 

lie Glu Thr Leu Ala Glu Glu Ser Ala Ala Lai Gly Val His Leu Phe 
340 345 350 

Val Met Aqp Asp Gly Trp Hie Gly Asp Lys lyr Pro Arg Val Ser PiSp 
355 360 365 



Asn Ala Gly Leu Gly Trp Met Pro Asn Pro Ala Arg Leu Pro Asp 
370 375 380 
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Gly JJEU Uttc Pro Val Val Gin fisp He Hvc Asn Leu Thr Val Asn Gly 
385 390 395 400 

Thr Glu Ser Ohr lys Leu Arg Phe Gly He Trp Val Glu Pro Glu Met 
405 410 415 

Val Asn Pro Asn Ser The Leu Tyr His Glu His Pro Glu Trp Ala Leu 
420 425 430 

His Ala Gly Pro Tyr Pro Arg Hir Glu Arg Arg Asn Gin Leu Val Leu 
435 440 445 

Asn Leu TOa Lai Pro Ala Val Gin Aq? Phe He He Asp Jhe Met Ihr 
450 455 460 

Asn Leu Leu Gin Asp Ihr Gly He Ser lyr Val Lys Trp Asp Asn Asn 
465 470 475 480 

Arg Gly He His Glu Ihr Pro Ser Pro Ser Ihr A^ His Gin Ty^ Meit 
485 490 495 

Leu Gly Leu Tyr Arg Val Phe Aq? nhr Leu Ihr Ihr Arg Phe Pro Asp 
500 505 510 

Val Leu Trp Glu Gly cys Ala Ser Gly Gly Gly Arg Phe Asp Ala Gly 
515 520 525 

Met Leu Gin Tyr Val Pro Gin He Tcp Bir Ser A^ Asn Ihr Asp Ala 
530 535 540 

He A^ Arg He Thr He Gin Phe Gly Ihr Ser Leu Ala lyr Pro Pro 
545 550 555 560 

Ser Ala Met Gly Ala His Laa Ser Ala Val Pro Asn Ala Gin Ihr Gly 
565 570 575 

Arg Ihr Val Pro He Ihr Phe Arg Ala His Val Ala Met Met Gly Gly 
580 585 590 

Ser Phe Gly Leu Glu Leu Asp Pro Ala Ihr Val Glu Gly A^ Glu He 
595 600 605 

Val Pro Glu Leu Leu Ala Leu Ala Glu Lys Val Asn Pro He He Lai 
610 615 620 

Asn Gly A^ Leu lyr Arg Lai Arg Lai Pro Gin Asp Ser Gin Trp P>ro 
625 630 635 640 

Ala Ala Leu Phe Val Ihr Gin Asp Gly Ala Gin Ala Val Lai Phe Tyc 
645 650 655 

Phe Arg Ser Ser Arg Met Ser Ihr Met Arg Arg Gly Ser Gly cys Trp 
660 665 670 



Gly Trp Ihr Glx 
675 



wo 94/23022 



PCT/DK94/00138 



43 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
10 (V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



15 



Gly Ala His Leu Ser Ala Val Pro Asn Ala Gin Ihr Gly Arg Ihr Val 
1 5 10 15 

Pro lie Ihr Fhe Arg Ala His Val 
20 



(2) INFORMATION FOR SEQ ID NO: 5: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Asp Gly Trp Phe Gly Aq? Lys oyr Pro Arg Val Ser A^ Asn Ala 
15 10 15 

Gly leu Gly Asp A^ 

35 20 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) 



(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 



(iii) 
(iii) 

(V) 



ANTI -SENSE: NO 



HYPOTHETICAL: NO 



FRAGMENT TYPE: internal 



(xi) 

Thr 
1 



SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Thr Arg Phe Pro Asp Val Leu Trp 

5 



10 



15 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ihr Ser Asp Asn Hir Asp Ala lie A^ Arg lie Thr lie Gin Fhe 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Leu Arg Leu Pro Gin Ser Gin Trp Pro Ala Ala Leu Phe 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
10 (V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Leu Glu Lai P^sp Pro Ala Thr Val Glu Gly A^ Glu He Val Pro 
15 10 15 



15 



Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 10: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Met Asp Asp Gly Trp Ite Gly Asp lys lyr Pro Arg Val Ser Asp 
15 10 15 



35 



Asn Ala Gly 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Met Asp Asp Gly Ttp Phe Gly Asp Lys Tyr Pro Arg Ser Aqp 
15 10 15 

Asn Ala Gly Leu Gly Asp Asp 
20 



(2) INFORMATION FOR SEQ ID NO: 12: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: synthetic DNA primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTACEAGINA TQGAYGAYGG NTQGIT 26 

30 (2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic DNA primer 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
5 TTCMCICOT CNOCW^OC NGC3^ 26 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
15 (iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic DNA primer 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TiGAQcraa' a^ccmsicc ngostt 26 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TIGftGCrcKr CNOCHfiO^OC NQGATT 26 
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(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
10 (V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
15 COSTIKIOOG AC7O0G 17 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
25 (iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

3D (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GmOOOGAC AAGIAOC 17 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CIGCftGICJGA CPCEAGAOGA TOOQOGGOCG dTITITnT TTITITITIT TTITr 55 

10 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
20 (V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
25 TIAdGC?^ OGACrCEAGA OGATOOGOG 29 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
35 (iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCICCKMG GTOGATOCXX; AOTICTCTOT MftGftGGMT GMGAAGGAA GAGAAGIGIG 60 
GMWaOGDV AKTIGfiGrnG GAAftCIOC3A GCMGGC2flC OCITQC 106 

5 (2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

•it5rix:ia3cr CTOGrcoTOt gg 22 

20 (2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCAftGCmA TCATCAOCAC CAIGAT 26 
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CLAIHS 

1. A DNA construct comprising a DNA sequence encoding a poly- 
5 peptide having a-galactosidase activity, wherein the DNA se- 
quence a) encodes a polypeptide comprising the amino acid 
sequence shown in the appended SEQ ID No. 3, or b) is an ana- 
logue of the DNA sequence of a) , which 

10 i) hybridiEes with the DNA sequence shown in the appended SEQ 
ID No. 1 or 2 or an oligonucleotide probe prepared on the 
basis of said DNA sequence or on the basis of the amino acid 
sequence shown in SEQ ID No. 3 under the conditions defined 
herein, 

15 

ii) encodes a polypeptide reactive with an antibody reacting 
with at least one epitope of a polypeptide comprising the a- 
mino acid sequence shown in the appended SEQ ID No. 3, and/or 

20 iii) encodes a polypeptide being at least 50% identical with 
the polypeptide having the amino acid sequence shown in the 
appended SEQ ID No. 3. 

2. A DNA construct according to claim 1, wherein the DNA se- 
25 quence is derived from a microorganism, a plant or a mammal. 

3. A DNA construct according to claim 1 or 2, wherein the DNA 
sequence is derived from a bacterium or a fungus. 

30 4. A DNA construct according to any of claims 1-3, wherein 
the DNA sequence is derived from a strain of Aspergillus , 
especially from a strain of A. niger . 

5. A DNA construct according to any of the preceding claims, 
35 in which the DNA sequence encodes an a-galactosidase having a 
pi in the range of 4.0-5.5 as determined by lEF as described 
herein, a pH optimum in the range of 5.0-7.0 determined under 
the conditions described herein, a temperature optimum within 
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of about 170.000 Da, and/or a specific activity of above 
about 250 GALU/mg protein^ 

6. A DNA construct according to any of claims 1-5, wherein 
5 the DNA sequence is as shown in the appended SEQ ID No, 1 or 

2. 

7. A DNA construct according to any of the preceding claims 
in which the DNA sequence is a cDNA sequence, a genomic DNA 

10 sequence or a synthetic DNA sequence or a mixed cDNA, genomic 
and/or synthetic DNA sequence. 

8. A recombinant expression vector comprising a DNA construct 
according to any of claims 1-7. 

15 

9. A cell comprising a DNA construct according to any of 
claims 1-7 or a vector according to claim 8. 

10. A cell according to claim 9, which is a microbial cell. 

20 

11. A cell according to claim 10 which is a bacterial cell, a 
yeast cell, or a fungal cell. 

12. A cell according to claim 11, in which the bacterial cell 
25 is a cell of a gram-positive bacterium such as Bacillus or 

Streptomvces or a cell of a gram-negative bacterium such as 
Escherichia , the yeast cell is a cell of Saccharomvces . and 
the fungal cell is a cell of Aspergillus ^ 

30 13. An a-galactosidase preparation encoded by a DNA construct 
according to any of claims 1-7. 

14. A process for producing an a-galactosidase enzyme or a 
variant thereof exhibiting a-galactosidase activity, compris- 
35 ing culturing a cell according to any of claims 9-12 in a 
suitable culture medium under conditions permitting express- 
ion of the a-galactosidase enzyme or the variant, and reco- 
vering the resulting enzyme or variant from the culture. 
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15. Use of an a-galactosidase preparation as claimed in claim 
13 for the hydrolysis of an a-galactoside to galactosies and 
sucroses. 

5 16. The use according to claim 15, in which the a-galactoside 
is present in composition prepared from legumes, nuts, seeds, 
grains, cereals or vegetables. 

17. The use according to claim 16 for the in vivo conversion 
10 of a-galactoside-linked sugars in mammals. 

18. The use according to claim 17 for pre-treatment of food 
or feed containing a-galactosides. 

15 19. Use of the a-galactosidase preparation according to claim 
13 as an digestive aid. 

20. A food or feed comprising an a-galactosidase preparation 
according to claim 13. 

20 

21. A method of preparing an enzyme--modif ied soy bean product 
comprising subjecting a composition containing the soy bean 
product to be modified to enzymatic treatment in the presence 
of an a-galactosidase preparation according to claim 13. 

25 

22. A method of converting a soy bean product comprises 

a) inserting a DNA construct according to any of claims 
1-7, optionally present in a suitable expression vector, 
into a suitable host organism, 

30 

b) culturing the host organism in a suitable culture me- 
dium under conditions permitting expression of the poly- 
peptide encoded by the DNA construct, and recovering the 
resulting polypeptide from the culture, and 

35 

c) subjecting a composition containing the soy bean pro- 
duct to be modified to enzymatic treatment in the pres- 
ence of the polypeptide recovered in step b) . 
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